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RESOLUTION  OF  THE  SPATIAL  DISTRIBUTION 
OF  THE  BACTERIOPHAGE  MS2  PROTEIN  AND  RNA  COMPONENTS 


1.  INTRODUCTION 

Bacteriophage  MS2  is  a  275A  RNA  virus  that  infects  male  Escherichia  coli 
(Stockley  et  al.,  1994).  Because  of  its  small  size,  relatively  simple  composition,  and  ease  of 
growth,  MS2  is  used  as  a  model  organism  for  a  number  of  macromolecular  (Peabody  and 
Al-Bitar,  2001;  Stockley  et  al.,  1994).  A  great  deal  is  known  about  the  MS2  bacteriophage.  Its 
complete  genome  has  been  sequenced  (Fiers  et  al.,  1976).  The  3,569  nucleotide  genome 
encodes  a  coat  protein,  a  maturation  protein  (or  A  protein),  a  replicase  subunit,  and  a  lysis 
protein  (Atkins  et  al.,  1979;  Fiers  et  al.,  1976).  The  MS2  coat  protein  is  the  primary  structural 
component  of  the  MS2  protein  shell.  In  addition  to  this  function,  it  binds  to  the  MS2  operator 
site  and  acts  as  a  translation  repressor  of  transcription  of  the  MS2  replicase  cistron.  The  A 
protein  (or  maturation  protein)  has  been  shown  to  be  involved  in  attachment  to  the  bacterial 
pilus,  replication,  RNA  packing  and  infectivity  in  vivo.  The  replicase  and  lysis  proteins  are 
involved  in  replication  and  the  lysis  of  the  E.  coli  bacteria,  respectively. 

The  MS2  virion  is  comprised  of  three  components:  the  coat  protein  (relative 
molecular  weight  Mr  =  13,700),  the  A  protein  (relative  molecular  weight  Mr  =  44,000),  and  a 
single-stranded  RNA  molecule.  The  3-dimensional  structure  of  the  intact  virion  has  been 
determined  and  refined  at  2.8A  resolution  (Golmohammadi  et  al.,  1993;  Valegard  et  al.,  1991; 
Valegard  et  al.,  1990;  Valegard  et  al.,  1986).  From  crystallographic  analysis,  the  MS2  virion  is 
thought  to  be  composed  of  90  coat  protein  homo-dimers  arranged  in  a  quasi-equivalent  T  =  3 
lattice  to  form  the  icosahedral  capsid  shell  of  the  type  described  by  Caspar  and  Klug,  1962.  In 
the  capsid,  coat  protein  dimers  are  thought  to  adopt  two  possible  non-covalent  quasi-equivalent 
arrangements,  A/B  and  C/C.  The  A  and  C  subunits  interact  at  the  quasi  6-fold  axes,  while  the 
B-type  subunits  interact  at  the  5-fold  axes.  Structurally,  the  primary  difference  between  these 
conformers  lies  in  the  position  of  the  FG  loop  region  of  the  protein.  In  the  A  and  C  subunits,  the 
FG  loop  is  extended,  while  in  the  C  subunit  it  is  folded  back  in  the  direction  of  the  protein 
(Valegard  et  al.,  1991;  Valegard  et  al.,  1990;  Valegard  et  al.,  1986).  In  addition  to  the  coat 
protein  dimers,  the  MS2  capsid  contains  a  single  copy  of  the  A  protein.  The  A  protein  also  has 
been  shown  to  be  tightly  associated  with  the  MS2  genomic  RNA  (Shiba  and  Suzuki,  1981), 
which  is  important  for  RNA  packing  in  vitro  (Argetsinger,  1966  #63;  Heisenberg,  1966  #64).  In 
addition  to  its  role  in  RNA  packing,  the  A  protein  is  important  for  host  recognition,  attachment, 
and  subsequent  transfer  of  phage  genomic  RNA  into  its  host  (Stockley  et  al.,  1994). 

The  purpose  of  this  study  is  to  extend  the  structural  characterization  of  the  MS2 
phage  by  examining  its  physical  characteristics  in  solution  by  use  of  small  angle  neutron 
scattering  (SANS).  Specifically,  we  are  interested  in  determining  the  spatial  relationship 
between  the  genomic  RNA  to  its  protein  shell. 

The  use  of  SANS  for  determining  molecular  weight  has  been  described  in  detail 
(Jacrot  and  Zaccai,  1981).  In  general,  SANS  is  a  process  where  a  neutron  beam  is  passed 
through  a  sample,  and  the  resulting  scattering  pattern  reveals  information  about  the  average  size, 
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shape,  and  orientation  of  the  sample  (Svergun  and  Koch,  2002;  Glinka  et  al.,  1998).  SANS 
experiments  do  not  cause  radiation  damage  to  the  sample,  and  typical  experiments  can  be  per¬ 
formed  under  physiological  conditions  in  solution.  Also,  when  the  concentration  (or  particle 
number)  of  the  sample  is  known,  then  the  molecular  weight  of  the  sample  can  be  determined  by 
SANS  since  the  data  are  obtained  on  an  absolute  scale  (usually  in  cm'1).  Similarly,  if  the  total 
molecular  weight  of  the  sample  is  known,  then  the  concentration  of  the  particles  in  the  sample 
can  be  determined  (Mazzone,  1998).  A  number  of  phage  and  viral  molecular  weights  have  been 
successfully  determined  by  this  method  (e.g.,  Frog  virus  3  Cuillel  et  al .,  1979),  influenza 
(Cusack  et  al.,  1985),  pfl  phage  (Torbet,  1979),  and  Semiliki  Forest  virus  (Freeman  and 
Leonard,  1980). 

SANS  is  a  powerful  tool  for  structural  analysis  but,  when  combined  with  the 
contrast  variation  method,  it  also  permits  additional  structural  information  to  be  obtained  about 
the  individual  components  in  a  macromolecular  complex.  In  the  case  of  MS2,  the  contrast 
variation  technique  involves  varying  the  solvent  water  to  deuterated  water  ratio  so  that  structural 
information  about  the  protein  and  nucleic  acid  components  can  be  obtained  separately 
(Glinka  et  al.,  1998;  Struhrmann  and  Miller,  1978). 


2.  EXPERIMENTAL  PROCEDURES* 

2.1  Bacteriophage,  Hosts,  and  Medium. 

The  MS2  bacteriophage  strain  15597-B1  and  its  Escherichia  coli  (E.  coli)  host 
15597  were  purchased  from  the  American  Type  Culture  Center  (Manassas,  VA).  E.  coli  strain 
15597  was  grown  on  MS2  broth.  MS2  broth  contains,  per  liter:  10  g  tryptone,  8  g  NaCl,  and  1  g 
Bacto-yeast.  After  autoclaving:  10  mL  of  sterile  10%  glucose,  2  mL  of  1  mol/1  (M)  CaCL,  and 
10  mg/mL  of  thiamine  hydrochloride  were  added  per  liter  (Davis  and  Sinsheimer,  1963).  The 
MS2  was  stored  in  Tris-Salt-Magnesium  (TSM)  buffer  unless  otherwise  stated.  The  TSM  buffer 
contains  10  mM  Tris  (pH  7.0),  100  mM  NaCl,  and  1  mM  MgCL- 

2.2  Growth  of  Bacteriophage. 

The  MS2  phage  was  grown  using  protocols  modified  from  (Sambrook  and 
Russell,  2001)  and  is  described  below.  Fresh  MS2  broth  (3  mL)  was  inoculated  with  400  pi  of 
an  overnight  culture  of  Escherichia  coli  strain  15597  [American  Type  Culture  Center  (Manassas, 
VA)]  at  OD600  =  1  (1  x  109  cells/mL).  The  MS2  bacteriophage  was  added  to  the  inoculant  at  a 
multiplicity  of  infection  of  0.01  to  3  and  incubated  for  20  min  at  37  °C.  The  mixture  was  added 
to  500  mL  of  pre-warmed  MS2  broth  and  incubated  8-12  hr.  Cell  lysis  was  induced  by  adding 
20  mL  of  chloroform  followed  by  shaking  for  10  min  at  37  °C.  Cultures  were  cooled  to  room 


^Certain  commercial  materials,  instruments,  and  equipment  are  identified  in  this  manuscript  to  specify  the 
experimental  procedure  as  completely  as  possible.  In  no  case  does  such  identification  imply  a  recommendation  or 
endorsement  by  the  National  Institute  of  Standards  and  Technology  (NIST)  nor  does  it  imply  that  the  materials, 
instruments,  or  equipment  identified  is  necessarily  the  best  available  for  the  purpose. 
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temperature  and  then  DNAse  I  and  RNAse  were  each  added  to  a  final  concentration  of 
50  mg/mL.  The  cultures  were  incubated  for  30  min  at  room  temperature,  and  then  29.2  g  of 
NaCl  (final  concentration,  1M)  was  added.  The  mixture  was  incubated  for  1  hr  on  ice  and 
centrifiiged  at  107,910  m/s2  (1 1,000  x  g)  for  10  min  at  4  °C.  To  the  supernatant,  45.2  g  of 
ammonium  sulfate  was  added  to  produce  a  20%  (w/w)  saturated  solution  and  incubated  at  4  °C 
for  2  hr.  The  mixture  was  centrifuged  at  1 1,000  x  g  for  30  min  at  4  °C.  Ammonium  sulfate 
(75.2  g)  was  added  to  the  supernatant  to  produce  a  50%  (w/w)  saturated  solution.  Following 
overnight  incubation  at  4  °C,  the  sample  was  centrifuged  at  1 1,000  x  g  for  20  min  at  4  °C.  The 
pellet  was  resuspended  in  30  mL  TSM,  and  1 1.0  g  of  ammonium  sulfate  was  added.  The 
solution  was  incubated  overnight  at  4  °C  and  then  centrifuged  at  1 1,000  x  g  for  20  min  at  4  °C. 
The  pellet  was  resuspended  in  35  mL  TSM  and  centrifuged  at  1 1 ,000  x  g  for  30  min.  For  the 
final  precipitation  step,  the  bacteriophage  mixture  was  incubated  at  4  °C  for  1  hr  or  overnight 
and  centrifuged  at  1 1,000  x  g  for  10  min  at  4  °C.  The  aqueous  phase  contained  the  crude  phage 
particles. 


Purified  MS2  phage  was  isolated  by  cesium  chloride  equilibrium  gradient.  The 
cesium  chloride  protocol  used  was  as  described  by  Sambrook  and  Russell,  2001  with  the 
following  modifications.  Cesium  chloride  was  dissolved  in  TSM  medium.  Ultra-centrifugation 
was  performed  at  23  °C.  Following  centrifugation,  the  samples  were  transferred  to  Slide-a  Lyzer 
(Pierce,  Rockford,  IL)  and  dialyzed  in  500  mL  TSM  for  24  hr  with  2  changes  of  buffer.  The 
measured  density  of  the  MS2  particles  was  1.38  +  0.01  g/cm3,  which  is  the  same  density  value 
reported  by  Strauss  and  Sinsheimer,  1963.  Samples  for  SANS  measurements  were  made  in  TSM 
buffers  containing  0,  10,  65,  85,  and  100%  D2O.  The  samples  were  dialyzed  in  the  appropriate 
buffers  for  2  hr  at  room  temperature,  with  two  changes  of  buffer,  and  then  transferred  to  sample 
holders. 

2.3  SANS  Measurements. 

SANS  measurements  were  performed  on  the  30-m  SANS  instruments  at  the  NIST 
Center  for  Neutron  Research  in  Gaithersburg,  MD  (Glinka  et  al.,  1998).  The  neutron 
wavelength,  X,  was  6 A,  with  a  wavelength  spread,  AX/X,  of  0.15.  Scattered  neutrons  were 
detected  with  a  64  cm  x  64  cm  2-dimensional  position  sensitive  detector  with  128  x  128  pixels. 
Raw  counts  were  normalized  to  a  common  monitor  count  and  corrected  for  empty  cell  counts, 
ambient  room  background  counts  and  non-uniform  detector  response.  Data  were  placed  on  an 
absolute  scale  by  normalizing  the  scattered  intensity  to  the  incident  beam  flux.  Finally,  the  data 
were  radially  averaged  to  produce  scattering  intensity,  I(Q),  versus  Q  curves,  where  Q  = 
47tsin(0)/X,  and  20  is  the  scattering  angle.  Sample-to-detector  distances  of  12  m  and  2.5  m  were 
used  to  cover  the  range  0.005  A'1  <  Q  <  0.1 7A"1.  The  scattered  intensities  from  the  samples  were 
then  further  corrected  for  buffer  scattering  and  incoherent  scattering  from  hydrogen  in  the 
samples. 
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2.4 


SANS  Data  Analysis. 


The  Guinier  approximation,  I(Q)  =  I(0)exp(-Q2Rg2/3),  was  used  on  the  low-Q 
portions  of  the  data  to  obtain  initial  values  for  the  radius  of  gyration,  Rg,  and  the  forward 
scattering  intensity,  1(0),  of  the  samples.  This  analysis  is  valid  only  in  the  region  where 
QRg  ~  1.  The  GNOM  program  (Semenyuk  and  Svergun,  1991),  which  uses  all  of  the  data, 
rather  than  a  limited  data  set  at  small  Q  values,  was  used  to  determine  the  distance  distribution 
function,  P(r);  the  radius  of  gyration,  Rg;  the  forward  scattering  intensity,  1(0);  and  the  maximum 
dimension,  Dmax.  Since  all  of  the  data  are  used,  this  approach  typically  leads  to  more  accurate 
determinations  of  Rg  and  1(0)  that  are  less  influenced  by  possible  aggregation  effects. 

Since  MS2  can  be  approximated  very  well  by  a  spherical  shell  at  the  resolution 
level  of  the  SANS  measurements,  the  data  were  also  fit  to  a  core-shell  sphere  model  (Semenyuk 
and  Svergun,  1991)  to  obtain  the  radius  of  the  protein  shell  and  RNA  core.  The  neutron 
scattering  length  density  of  the  RNA  core  was  an  additional  fitting  parameter  that  allowed  the 
amount  of  water,  versus  RNA,  in  the  core  to  be  calculated  using  eq.  1 : 

PCORE  =  XpRNA  +  ( 1  -X)  PSOLVENT  ( 1 ) 

where  X  is  the  fraction  of  RNA  in  the  core,  pcore  is  the  fitted  scattering  length  density  of  the 
core  portion  of  the  core-shell  model,  and  prna  and  Psolvent  are  the  known  scattering  length 
densities  of  the  RNA  and  the  solvent,  respectively.  The  core-shell  model  fits  consider  the 
resolution  function  of  the  SANS  instruments. 

The  scattered  intensities  from  the  MS2  protein/RNA  complex  were  decomposed 
into  the  scattering  from  their  components,  Iprot(Q)  and  Irna(Q)  using  eq.  2: 

2  2 
I(Q)  =  Approt  Iprot(Q)  +  ApprotAprnaIprotrna(Q)  +  Aprna  Irna(Q)  (2) 

where  Ap  =  (p-ps)  is  the  contrast,  or  the  difference  between  the  scattering  length  density  of  the 
molecule  (p)  and  the  solvent  (ps).  The  cross-term,  Iprotrna(Q),  represents  the  interference 
function  between  the  protein  and  RNA  components.  The  known  quantities  in  eq.  1  are  Approt 
and  Aprna  The  unknowns  are  Iprot(Q),  Irna(Q),  and  Iprotrna(Q).  Since  measurements  were 
made  at  five  different  contrasts,  or  D2O/H2O  buffer  conditions,  there  is  sufficient  information  to 
solve  for  the  three  unknown  component  intensities  from  the  set  of  simultaneous  equations  for 
I(Q)  at  each  contrast. 

The  Mw  values  of  the  protein  and  RNA  components  of  MS2  were  calculated  in  a 
similar  manner  using  eq.  3: 


1(0)  —  n(AppRoiVpROT  +  Aprna  Vrna)2  (3) 

where  n  is  the  number  density  of  MS2  particles,  and  Vprot  and  Vrna  are  the  volumes  of  the 
protein  and  RNA  components,  respectively.  These  volumes  can  be  written  as  V  =  Mw/(  NAd), 
where  d  is  the  mass  density  and  Na  is  Avogadro’s  number.  Now,  eq.  2  can  be  rewritten  as: 
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where  dpROT  =  1-38  g/cm3  and  dRNA  =  1.89  g/cm3.  Now,  there  are  only  two  unknowns,  Approt 
and  ApRNA-  The  1(0)  values  obtained  from  the  GNOM  analysis  of  the  data  for  each  D2O/H2O 
buffer  can  be  used  with  the  measured  number  densities  to  solve  the  set  of  simultaneous  equations 
for  these  two  unknowns  to  obtain  the  Mw  values  for  the  protein  and  RNA  components  separately 
in  the  MS2  complex.  The  total  Mw  value  is  then  simply  the  sum  of  the  two  component  Mw 
values.  It  is  important  to  note  that  1(0)  must  be  on  an  absolute  scale,  usually  in  cm'1,  to  obtain 
accurate  Mw  values  from  eq.  3  or  4. 

2.5  Number  Density  Determinations. 

Number  density  determinations  were  made  using  two  methods:  (1)  the 
concentration  was  measured  by  optical  density  (OD)  using  a  conventional  spectrophotometer 
and  then  the  number  density  was  estimated  using  this  information,  and  (2)  the  number  density 
was  obtained  directly  using  the  Integrated  Virus  Detection  System  (IVDS),  which  is  a  particle 
counting  method.  Measurements  were  obtained  both  before  and  after  dialysis  and  subsequent 
SANS  experiments.  However,  only  the  measurements  taken  after  dialysis  are  used  and  reported 
here. 


72 


|  aA, 


(  I  A  ^  I  \ 

PROT 


v  Na  dp.—.  , 


M  + 

1V1  Wprot  t 


\APk 


v  Na  d  / 


M 


Wrna 


(4) 


The  number  densities  calculated  from  the  OD260  measurements  were  found  from 
the  measured  concentration,  c,  using  n  =  cNa/Mw,  where  Na  is  Avogadro’s  number,  and  Mw  is 
the  total  molecular  weight  of  the  MS2  particle.  Since  n  has  units  of  cm'3,  c  must  be  converted  to 
units  of  g-cm'3.  Sample  concentrations  were  measured  after  the  SANS  experiments  by 
measuring  the  absorbance  at  260  nm  and  using  Beer’s  Law, 

c  =  A26o/e-L,  (5) 

where  £  is  the  molar  coefficient,  and  L  is  the  pathlength  of  the  light,  to  calculate  the 
concentration  (Eisenberg,  1979).  Since  the  molar  coefficient  is  also  dependent  upon  the  total 
Mw  of  the  particle,  this  method  of  determining  the  number  density  is  only  useful  if  the  total  Mw 
of  the  particle  is  known  (Eisenberg,  1979).  Sample  concentrations  were  measured  in  duplicate 
using  a  Hewlett-Packard  model  8450A  spectrophotometer.  The  spectrophotometer  was 
calibrated  using  NIST  transmittance  and  wavelength  Standard  Reference  Material  numbers  930, 
2031,  and  2034. 

The  IVDS  was  used  to  determine  MS2  particle  number  directly.  The  IVDS  is  a 
bipartite  instrument  consisting  of  (1)  an  ultra-filtration  unit  for  use  in  the  purification  and/or 
concentration  of  materials  for  analysis,  and  (2)  a  gas-phase  electrophoretic  mobility  analyzer 
(GEMMA)  detector  for  particle  counting  and  sizing  measurements  (Wick  and  McCubbin, 
1999a).  The  ultra-filtration  unit  has  been  previously  described  (Wick  and  McCubbin,  1999c) 
and  was  not  used  for  these  experiments.  This  work  used  the  GEMMA  detector  system  only. 

The  GEMMA  detector  consists  of  an  electro  spray  that  sprays  the  sample  into  the  detector,  a 
differential  mobility  analyzer  to  separate  the  sample  by  size  and  a  condensation  particle  counter 
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for  particle  counting.  These  components  are  in  a  single  module.  The  complete  IVDS  instrument 
has  been  previously  described  in  detail  and  was  originally  designed  to  detect,  quantify  and  size 
viruses  in  the  10  to  100  nm  size  range  (Wick  and  McCubbin,  1999a;  Wick  and  McCubbin, 
1999b;  Wick  and  McCubbin,  1999c).  The  IVDS  instrument  was  calibrated  using  a  NIST- 
Traceable  Standard  Reference  Material.  The  particle  number  of  samples  for  the  experiments 
described  here  was  determined  using  the  optimal  usage  procedures  and  calibration  conditions 
described  by  Kuzmanovic  (unpublished  data,  2003). 


Table  1.  Parameters  from  Core-Shell  Model  Fit 


Sample 

Rl  (A) 

R2  (A) 

t(A) 

1(0)  cm'1 

Experiment  #1 

0%  d2o 

112 

134 

22 

4.03  ±  0.02 

io%d2o 

84 

141 

57 

2.50  ±  0.02 

65%  D20 

118 

137 

19 

0.46  +  0.01 

85%  D20 

115 

133 

18 

2.10  ±  0.01 

100%  d2o 

115 

139 

24 

6.65  ±  0.05 

Average  Values 

115  ±  1 

136  ±  1 

21  ±  1 

— 

Experiment  #2 

0%  d2o 

110 

136 

26 

5.47  ±0.01 

io%d2o 

95 

139 

44 

4.46  ±  0.01 

65%  D20 

116 

144 

28 

0.79  ±  0.03 

85%  D20 

112 

136 

14 

2.83  ±  0.02 

100%  d2o 

112 

139 

27 

7.94  ±  0.02 

Average  Values* 

"T r- - : - - — : - r 

113  ±  1 

139  ±2 

24  ±3 

— 

The  values  obtained  for  the  1 0%  D20  samples  were  excluded  when  calculating  the  average 


values.  Errors  in  the  average  values  for  R1 ,  R2,  and  t  are  standard  deviations  of  the  mean  and 
do  not  include  unknown  systematic  errors.  Errors  in  1(0)  values  are  standard  deviations  of  the 
mean  from  several  fits  that  were  made  for  each  %D20  condition,  yielding  the  same  values  for 
Rl,  R2,  and  t. 
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Figure  1.  Core-Shell  Model  Fit  for  MS2.  A  sample  core-shell  model  fit,  with  ( — )  and  without 

( - )  correcting  the  model  for  instrumental  resolution  effects,  for  the  100%  D2O  data  (o)  of 

Experiment#!. 


3.  RESULTS 

Two  complete  contrast  variation  series  of  measurements  were  performed  on  two 
different  MS2  sample  preparations  and  are  designated  as  Experiment  #1  and  Experiment  #2.  At 
the  resolution  of  the  SANS  measurements,  the  shape  of  an  MS2  particle  can  be  approximated 
very  well  by  a  spherical  shell,  with  inner  radius,  Rl,  outer  radius,  R2,  and  shell  thickness, 
t  =  R2  -  R1 .  Fits  to  this  core-shell  model  were  made  for  the  data  at  each  contrast,  and  the  results 
are  shown  in  Table  1.  A  sample  model  fit,  made  with  and  without  correcting  the  model  for 
instrumental  resolution  effects,  is  shown  for  the  100%  D2O  data  of  Experiment  #1  in  Figure  1. 
Table  1  shows  that,  in  all  cases,  the  outer  radius  of  the  shell,  R2,  consistently  falls  between  the 
values  of  134A  and  144 A.  The  (core)  inner  radius,  Rl,  falls  between  11 0A  and  1 18 A,  except  for 
the  10%  D2O  buffer  sample,  which  consistently  shows  a  much  smaller  Rl  value  for  both 
experiments.  The  RNA  in  the  core  scatters  strongly  in  comparison  with  the  protein  shell  under 
these  solvent  conditions.  Thus,  at  this  contrast,  the  lower  value  for  Rl  could  be  an  indication 
that  the  RNA  is  actually  packed  compactly  and  does  not  completely  fill  the  core  region,  with  the 
remainder  of  the  core  being  mostly  solvent.  The  amount  of  water  in  the  core  was  calculated 
from  the  fitted  scattering  length  density  of  the  core  region  for  the  data  at  each  contrast  using  eq. 

1.  The  average  fraction  of  water  in  the  core  region  was  0.81  ±  0.04  for  Experiment  #1  and  0.77 
±  0.03  for  Experiment  #2.  If  the  values  obtained  in  10%  D2O  are  excluded,  the  average 
parameters  obtained  from  the  core-shell  model  fit  are  R1  =  115±1,R2=136±1,  and  t  =  21  +  1 
for  Experiment  #1,  and  Rl  =  1 13  ±  1,  R2  =  139  ±  2,  and  t  =  24  ±  3  for  Experiment  #2. 
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Table  2.  Parameters  from  Distance  Distribution  Function  Determination 


Sample 

Rg(A) 

1(0)  cm’1 

Experiment  #1 

0%  D20 

1 14  ±  1 

4.10  ±0.02 

io%d2o 

1 14  ±  1 

2.60  ±  0.02 

65%  D20 

130  ±  1 

0.46  ±  0.07 

85%  D20 

120.0  ±0.05 

2.10  ±0.01 

100%  d2o 

122.0  +  0.05 

6.70  ±  0.01 

Experiment  #2 

0%  D20 

115  +  1 

5.48  ±  0.03 

io%d2o 

1 14  ±  1 

4.48  ±  0.03 

65%  D20 

127+1 

0.77  +  0.01 

85%  D20 

124.0  ±0.05 

2.82  ±  0.02 

100%  D20 

123.0  ±0.05 

7.94  ±  0.02 

Errors  are  standard  deviations  of  the  mean  for  several  equally  good  fits  made  at  each  %D20 
condition. 


Figure  2.  Distance  Distribution  Functions  for  MS2.  Distance  distribution  functions,  P(r)  vs.  r, 

from  the  data  for  samples  in  100%  D2O  ( — ),  85%  D2O  ( - ),  65%  D2O  ( - ), 

10%  D2O  ( - ),  and  0%  D2O  (•  •  •)  from  (a)  Experiment  #1  and  (b)  Experiment  #2. 
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Distance  distribution  functions,  P(r),  were  obtained  from  the  data  as  described  in 
Section  2,  and  the  fitted  parameters  are  listed  in  Table  2.  The  resultant  P(r)  functions  for 
Experiment  #1  are  plotted  in  Figure  2a,  and  those  for  Experiment  #2  are  plotted  in  Figure  2b. 
The  P(r)  functions  are  normalized  so  that  the  peak  value  is  equal  to  1.0  in  each  case.  The 
maximum  distance,  Dmax,  in  all  cases,  was  300A,  which  is  larger  than  2*R2.  By  definition,  Draax 
is  the  distance  at  which  P(r)  goes  to  zero.  Thus,  Dmax  suggests  a  sharp  boundary  between  the 
particle  and  its  surroundings.  Since  the  shape  of  the  MS2  coat  protein  region  is  actually 
icosahedral,  this  boundary  is  not  sharp,  and  the  P(r)  functions  suggest  that  the  particle  does 
actually  extend  beyond  2*R2.  However,  the  number  of  probable  distances  beyond  R2  drops 
sharply. 


For  both  experiments,  the  P(r)  function  for  the  65%  D20  sample  is  consistent  with 
that  of  a  hollow  spherical  shell.  In  this  case,  the  peak  of  the  distance  distribution  is  at  ~200A, 
consistent  with  the  fact  that  the  most  probable  distances  are  occurring  beyond  2*R1.  In  65% 
D20,  the  scattering  length  density  of  the  RNA  component  of  the  complex  is  the  same  as  that  of 
the  solvent.  Thus,  the  scattering  from  the  RNA  is  masked  and  only  the  scattering  from  the 
protein  is  observed.  Therefore,  the  fact  that  MS2  most  closely  resembles  a  hollow  sphere  at  this 
contrast  is  expected. 

On  the  other  hand,  the  peak  of  the  distance  distribution  function  occurs  at  values 
smaller  than  2*R1  for  the  data  obtained  at  the  other  contrasts.  The  scattering  from  the  RNA 
component  is  the  strongest,  relative  to  the  protein  component,  in  the  0%  and  10%  D20 
conditions,  which  is  evident  from  the  fact  that  the  peak  in  P(r)  is  at  the  smallest  r- values  under 
these  conditions.  In  other  words,  the  RNA  component  is  contributing  more  to  the  total  scattering 
in  0%  and  10%  D20,  and  this  is  reflected  as  a  shift  in  the  peak  in  P(r)  to  smaller  r  values.  In 
85%  and  100%  D20,  the  RNA  component  is  contributing  to  the  total  scattering,  but  the 
scattering  from  the  protein  component  is  much  stronger.  Thus,  the  peak  in  P(r)  falls  in  between 
the  65%  D20  case  and  the  0%  and  10%  D20  cases. 


Figure  3.  Distance  Distribution  Functions  for  the  Components  of  MS2.  Distance  distribution 

functions,  P(r)  vs.  r,  for  the  protein  ( — )  and  RNA  ( - )  components  of  the  MS2  particles  from 

Experiment  #1. 
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The  scattered  intensities  from  the  RNA  and  protein  components  were  separated 
from  each  other  using  eq.  2.  The  resultant  P(r)  functions  are  shown  in  Figure  3  for 
Experiment  #1 .  This  process  was  also  attempted  using  the  data  from  Experiment  #2.  However, 
the  results  were  much  noisier,  probably  owing  to  the  poorer  quality  of  the  data  relative  to  the 
data  obtained  in  Experiment  #1 .  The  results  for  Experiment  #1  show  that,  while  Dmax  for  the 
protein  shell  remains  at  300A,  Dmax  for  the  RNA  core  was  165 A.  Thus,  the  RNA  component 
appears  to  be  confined  mostly  within  a  radius  of  ~83A.  The  peak  of  the  RNA  P(r)  distribution  is 
also  around  this  value.  These  results  agree  very  well  with  the  R1  values  from  the  core-shell 
model  fits  for  the  samples  measured  in  10%  D2O  for  Experiment  #1  and  Experiment  #2  (see 
Table  1).  Recall,  that  the  10%  D2O  solvent  condition  is  where  the  RNA  scattering  is  the 
strongest  relative  to  that  of  the  protein. 


4.  DISCUSSION 

The  MS2  bacteriophage  is  a  model  organism  for  a  number  of  important  areas  of 
research  including  viral  replication,  infection,  and  assembly  (Stockley  et  al.,  1994).  Recently, 
noninfectious,  genetically  modified  forms  of  the  MS2  phage  that  contain  varying  amounts  of 
RNA  (compared  to  the  wild-type  phage)  have  been  developed  for  use  as  biological  standards 
(Pasloske  et  al.,  1998;  Stockley  and  Mastico,  2000).  These  commercially  available  recombinant 
particles,  Armored  RNAs,  are  used  as  reference  material  in  research  assays  for  the  HIV;  Ebola; 
Boma;  Hepatitis  A,  C,  and  G;  Dengue;  Enterovirus;  West  Nile;  and  Norwalk  viruses,  among 
others  (Ambion,  personal  communication,  2003). 

The  use  of  these  particles  as  biological  standards  in  public  health  screening  of 
humans  and  livestock  has  been  hampered  by  the  lack  of  rapid  quantitative  methods  to  analyze 
the  physical  properties  of  this  family  of  particles,  not  found  in  nature,  which  cannot  be 
scientifically  characterized  by  traditional  methods.  These  MS2-like  biomarkers,  because  of  their 
small  size  and  the  necessity  that  they  be  noninfectious,  cannot  be  rapidly  or  reliably  counted.  As 
a  result,  this  new  generation  of  biological  reference  material  cannot  be  cheaply  characterized  for 
general  use  in  public  health  laboratories.  This  is  solely  due  to  the  fact  that  their  physical 
properties  in  solution  cannot  be  quantified  or  confirmed.  Thus,  there  is  a  need  for 
instrumentation  that  can  count  biological  particles  about  which  nothing  is  known  and  that  also 
can  provide  structural  information  about  their  properties  in  solution. 

The  creation  of  these  new  forms  of  MS2  has  made  it  increasingly  important  to 
understand  the  relationship  of  the  indigenous  MS2  RNA  to  its  protein  shell  and  to  measure  the 
Mw  of  the  wild-type  RNA  molecule  in  vivo  under  biological  conditions.  Currently,  this  can  only 
be  accomplished  by  combining  small-angle  neutron  scattering  (SANS)  with  a  novel  virus 
counting  instrument,  the  IVDS,  since  there  is  no  other  method  to  rapidly  count  small  (<100  nm) 
biological  particles  with  unknown  properties  in  solution  in  the  absence  of  information  about  the 
particle  molecular  weight  (Mw)  or  infectivity.  This  initial  study,  which  is  the  first  of  a  larger  set 
of  completed  experiments  using  recombinant  biomarkers,  serves  as  a  model  for  the  use  of  SANS 
and  IVDS  as  a  virus  identification  and  characterization  tool. 
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Our  results  show  that  (a)  the  MS2  RNA  is  tightly  compacted  within  the  virion, 
confined  to  a  radius  of  83  ±  1  A,  (b)  the  MS2  coat  protein  shell  extends  to  a  radius  of  136  ±  2 A 
and  has  a  thickness  of  21  ±  1  A.  Significantly,  we  were  able  to  measure  the  spatial  extent  of  the 
protein  shell  and  RNA  core  separately  as  well,  and  to  show  that  the  RNA  core  is  compact.  For 
about  40  years,  researchers  studying  MS2  have  speculated  about  the  nature  of  the  native  MS2 
RNA  in  vivo  (Strauss  and  Sinsheimer,  1963).  Interest  in  this  issue  was  initially  spurred  by  a 
conflict  between  the  ideal  Mw  of  small  RNA  viruses  in  general,  as  deduced  from  the  apparent 
particle  size  from  electron  microscopic  studies  and  the  density  of  the  particle  in  cesium  chloride 
(CsCl),  compared  to  the  experimentally  determined  Mw  as  measured  using  sedimentation 
gradient  experiments  and  the  resulting  associated  Sedvberg  constant  (S20,w).  A  virus  like  MS2, 
with  an  approximate  particle  size  of  23  to  28  nm  based  on  electron  microscopy,  and  with  a 
density  in  CsCl  of  1 .41  g/cm3  would  be  expected  to  have  a  molecular  weight  on  the  order  of 
9.8  x  106  g/mol.  However,  based  on  experimental  sedimentation  data,  the  Mw  of  MS2  was 
determined  to  be  3.6  x  106  g/mol.  This  discrepancy  between  ideal  and  experimental  molecular 
weight  has  been  seen  in  a  variety  of  small  RNA  viruses  (Kaesberg,  1959).  Initially,  this  was 
thought  to  be  due  to  some  unusual  features  related  to  the  interaction  of  CsCl  with  viral  RNA. 
However,  subsequent  experiments  using  deuterated  water  rather  than  cesium  chloride  yielded 
similar  results  (Overby  et  al.,  1966).  The  idea  that  the  core  of  the  virus  RNA  might  contain 
hollow  regions  with  solvent  was  proposed  to  explain  these  discrepancies,  but  this  theory  was  not 
proven  (Overby  et  al.,  1966).  Our  results  show  definitively  that  indeed  the  RNA  is  tightly 
packed  within  the  MS2  protein  shell.  The  amount  of  water  in  the  core  was  calculated  from  the 
fitted  scattering  length  density  of  the  core  region  for  the  data  at  each  contrast.  The  average 
fraction  of  water  in  the  core  region  was  found  experimentally  to  be  0.81  ±  0.04. 

Indirect  genetic  and  biochemical  results  hint  that  a  variety  of  mechanisms  may  act 
in  concert  to  fold  and  compact  the  MS2  genomic  RNA.  The  complete  genome  of  the  MS2  RNA 
has  been  determined  (Fiers  et  al.,  1976).  Sequence  analysis  of  the  nucleotide  sequence  predicts 
that  the  MS2  RNA  spontaneously  folds  into  a  number  of  important  secondary  structures 
(Fiers  et  al.,  1976).  Perhaps  the  most  important  of  these  secondary  motifs  is  a  stem-loop 
structure  found  in  the  MS2  operator  site.  The  MS2  coat  protein  has  been  shown  to  specifically 
bind  the  stem-loop  site  and  to  selectively  regulate  the  virus  assembly  and  transcription  of  the 
MS2  transcriptional  operon  (Stockley  et  al.,  1994).  Co-crystallographic  experiments  involving 
the  empty  MS2  capsid  and  a  RNA  fragment  containing  the  stem-loop  region  of  the  MS2  operator 
site  have  shown  that  the  MS2  coat  protein  and  MS2  genomic  RNA  interact  at  20  different  sites 
along  the  MS2  RNA  backbone. 

The  A  protein  also  has  been  shown  to  directly  bind  the  MS2  genomic  RNA  at  its 
5  ft  and  3  ft  end.  A  number  of  lines  of  evidence  point  to  the  tight  association  between  the 
A  protein  and  the  genomic  RNA.  First,  the  A  protein  and  genomic  RNA  are  selectively  co¬ 
precipitated.  Second,  RNAse  protection  and  competition  experiments  followed  by  sequence 
analysis  of  the  protected  RNA  indicate  that  the  MS2  RNA  is  tightly  bound  by  the  A  protein  at  its 
5  ft  and  3  ft  end  (Shiba  and  Suzuki,  1981).  This  tight  association  between  the  genomic  RNA  and 
the  A  protein  has  been  shown  to  be  important  for  RNA  packing  in  vitro.  Mutant  MS2 
bacteriophage,  which  lack  the  A  protein,  contain  loosely  packed  RNA  that  protrudes  from  the 
MS2  protein  shell  and  then  becomes  degraded  by  nucleases  in  the  media  (Argetsinger  and 
Gussin,  1966;  Heisenberg,  1966).  In  addition,  the  small  polyanion,  spermidine,  is  selectively 
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sequestered  in  the  capsid  core  (Leipold,  1977;  Jacobson  et  al.,  1985).  In  vitro  experiments  using 
isolated  MS2  RNA/A  protein  complexes  have  revealed  that  increased  levels  of  spermidine  lead 
to  increased  compaction  of  the  RNA/A  protein  complex  as  measured  by  its  migration  in 
sedimentation  gradients,  as  well  as  by  its  increased  infectivity  of  the  complex  in  vitro  (Leipold, 
1977). 


Taken  together,  this  body  of  in  vitro  experiments  suggests  strongly  that  the  MS2 
RNA  is  tightly  compacted.  Our  work  is  the  first  study  to  directly  measure  the  spatial  distribution 
of  the  MS2  genomic  RNA  under  indigenous  conditions  and  to  confirm  that  it  is  indeed  compact 
in  vivo 
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